Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 3 |
race is highly imbalanced (72.1%) | Imbalance |
age has unique values | Unique |
capital-gain has unique values | Unique |
workclass has 817 (8.2%) zeros | Zeros |
education has 255 (2.5%) zeros | Zeros |
marital-status has 1478 (14.8%) zeros | Zeros |
occupation has 1276 (12.8%) zeros | Zeros |
relationship has 4620 (46.2%) zeros | Zeros |
native-country has 134 (1.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-07 22:54:39.896915 |
|---|---|
| Analysis finished | 2023-12-07 22:54:55.275328 |
| Duration | 15.38 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
age
Real number (ℝ)
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.80156 |
| Minimum | 13.253723 |
|---|---|
| Maximum | 105.02939 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 13.253723 |
|---|---|
| 5-th percentile | 17.692909 |
| Q1 | 26.480108 |
| median | 34.329525 |
| Q3 | 42.801784 |
| 95-th percentile | 60.73332 |
| Maximum | 105.02939 |
| Range | 91.775667 |
| Interquartile range (IQR) | 16.321676 |
Descriptive statistics
| Standard deviation | 13.234366 |
|---|---|
| Coefficient of variation (CV) | 0.36965893 |
| Kurtosis | 1.5652641 |
| Mean | 35.80156 |
| Median Absolute Deviation (MAD) | 8.1143218 |
| Skewness | 1.0068311 |
| Sum | 358015.6 |
| Variance | 175.14845 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38.42623395 | 1 | < 0.1% |
| 33.47605882 | 1 | < 0.1% |
| 51.49675285 | 1 | < 0.1% |
| 52.34532394 | 1 | < 0.1% |
| 31.05434401 | 1 | < 0.1% |
| 15.03968694 | 1 | < 0.1% |
| 18.89680623 | 1 | < 0.1% |
| 33.4505458 | 1 | < 0.1% |
| 34.1117228 | 1 | < 0.1% |
| 16.33749839 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 13.253723 | 1 | |
| 13.31528595 | 1 | |
| 13.61147563 | 1 | |
| 13.63157115 | 1 | |
| 13.67428287 | 1 | |
| 13.7243673 | 1 | |
| 13.77827064 | 1 | |
| 13.7895904 | 1 | |
| 13.79679007 | 1 | |
| 13.7999897 | 1 |
| Value | Count | Frequency (%) |
| 105.0293905 | 1 | |
| 102.1843329 | 1 | |
| 99.80568432 | 1 | |
| 96.31696959 | 1 | |
| 94.26557406 | 1 | |
| 93.88988886 | 1 | |
| 93.38837156 | 1 | |
| 92.64809681 | 1 | |
| 91.80377173 | 1 | |
| 91.41977369 | 1 |
workclass
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6691 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 817 |
| Zeros (%) | 8.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.6738236 |
|---|---|
| Coefficient of variation (CV) | 0.45619459 |
| Kurtosis | 0.32634182 |
| Mean | 3.6691 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.5695488 |
| Sum | 36691 |
| Variance | 2.8016854 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 6042 | |
| 2 | 934 | 9.3% |
| 6 | 910 | 9.1% |
| 0 | 817 | 8.2% |
| 1 | 514 | 5.1% |
| 7 | 384 | 3.8% |
| 5 | 368 | 3.7% |
| 3 | 19 | 0.2% |
| 8 | 12 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 817 | 8.2% |
| 1 | 514 | 5.1% |
| 2 | 934 | 9.3% |
| 3 | 19 | 0.2% |
| 4 | 6042 | |
| 5 | 368 | 3.7% |
| 6 | 910 | 9.1% |
| 7 | 384 | 3.8% |
| 8 | 12 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 12 | 0.1% |
| 7 | 384 | 3.8% |
| 6 | 910 | 9.1% |
| 5 | 368 | 3.7% |
| 4 | 6042 | |
| 3 | 19 | 0.2% |
| 2 | 934 | 9.3% |
| 1 | 514 | 5.1% |
| 0 | 817 | 8.2% |
fnlwgt
Real number (ℝ)
| Distinct | 9999 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 190645.2 |
| Minimum | -4265.5312 |
|---|---|
| Maximum | 1152958.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 8 |
| Negative (%) | 0.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -4265.5312 |
|---|---|
| 5-th percentile | 42648.068 |
| Q1 | 123351.4 |
| median | 178107.7 |
| Q3 | 235137.46 |
| 95-th percentile | 399321.48 |
| Maximum | 1152958.3 |
| Range | 1157223.8 |
| Interquartile range (IQR) | 111786.06 |
Descriptive statistics
| Standard deviation | 106585.54 |
|---|---|
| Coefficient of variation (CV) | 0.55907802 |
| Kurtosis | 5.1603199 |
| Mean | 190645.2 |
| Median Absolute Deviation (MAD) | 55963.151 |
| Skewness | 1.429964 |
| Sum | 1.906452 × 109 |
| Variance | 1.1360478 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 237043.9545 | 2 | < 0.1% |
| 33512.99359 | 1 | < 0.1% |
| 330847.5676 | 1 | < 0.1% |
| 225040.2482 | 1 | < 0.1% |
| 306611.0177 | 1 | < 0.1% |
| 218952.3342 | 1 | < 0.1% |
| 203691.7827 | 1 | < 0.1% |
| 128932.4123 | 1 | < 0.1% |
| 30593.14703 | 1 | < 0.1% |
| 241476.6135 | 1 | < 0.1% |
| Other values (9989) | 9989 |
| Value | Count | Frequency (%) |
| -4265.531207 | 1 | |
| -3634.933404 | 1 | |
| -2370.857371 | 1 | |
| -1769.417418 | 1 | |
| -1755.846067 | 1 | |
| -1286.695882 | 1 | |
| -913.9393712 | 1 | |
| -623.3555064 | 1 | |
| 245.6076511 | 1 | |
| 1275.659092 | 1 |
| Value | Count | Frequency (%) |
| 1152958.258 | 1 | |
| 1087774.398 | 1 | |
| 1032063.775 | 1 | |
| 999532.3943 | 1 | |
| 978093.5148 | 1 | |
| 959298.0064 | 1 | |
| 941990.6443 | 1 | |
| 941153.9102 | 1 | |
| 910806.9286 | 1 | |
| 881025.7012 | 1 |
education
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.8822 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 255 |
| Zeros (%) | 2.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 11 |
| Q3 | 11 |
| 95-th percentile | 15 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.8729089 |
|---|---|
| Coefficient of variation (CV) | 0.39190756 |
| Kurtosis | 0.50351297 |
| Mean | 9.8822 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.93598383 |
| Sum | 98822 |
| Variance | 14.999423 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 3890 | |
| 15 | 1587 | |
| 9 | 1160 | 11.6% |
| 1 | 485 | 4.9% |
| 12 | 459 | 4.6% |
| 8 | 457 | 4.6% |
| 7 | 386 | 3.9% |
| 0 | 255 | 2.5% |
| 10 | 253 | 2.5% |
| 5 | 233 | 2.3% |
| Other values (6) | 835 | 8.3% |
| Value | Count | Frequency (%) |
| 0 | 255 | 2.5% |
| 1 | 485 | |
| 2 | 159 | 1.6% |
| 3 | 109 | 1.1% |
| 4 | 171 | 1.7% |
| 5 | 233 | 2.3% |
| 6 | 135 | 1.4% |
| 7 | 386 | 3.9% |
| 8 | 457 | 4.6% |
| 9 | 1160 |
| Value | Count | Frequency (%) |
| 15 | 1587 | |
| 14 | 209 | 2.1% |
| 13 | 52 | 0.5% |
| 12 | 459 | 4.6% |
| 11 | 3890 | |
| 10 | 253 | 2.5% |
| 9 | 1160 | 11.6% |
| 8 | 457 | 4.6% |
| 7 | 386 | 3.9% |
| 6 | 135 | 1.4% |
education-num
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.8858 |
| Minimum | 1 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9 |
| median | 10 |
| Q3 | 12 |
| 95-th percentile | 14 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.0730283 |
|---|---|
| Coefficient of variation (CV) | 0.31085276 |
| Kurtosis | 0.14830832 |
| Mean | 9.8858 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.45669183 |
| Sum | 98858 |
| Variance | 9.4435027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 2494 | |
| 10 | 2328 | |
| 14 | 1111 | |
| 13 | 960 | 9.6% |
| 12 | 457 | 4.6% |
| 4 | 430 | 4.3% |
| 11 | 428 | 4.3% |
| 7 | 385 | 3.9% |
| 6 | 273 | 2.7% |
| 5 | 266 | 2.7% |
| Other values (6) | 868 | 8.7% |
| Value | Count | Frequency (%) |
| 1 | 49 | 0.5% |
| 2 | 132 | 1.3% |
| 3 | 194 | 1.9% |
| 4 | 430 | 4.3% |
| 5 | 266 | 2.7% |
| 6 | 273 | 2.7% |
| 7 | 385 | 3.9% |
| 8 | 136 | 1.4% |
| 9 | 2494 | |
| 10 | 2328 |
| Value | Count | Frequency (%) |
| 16 | 185 | 1.8% |
| 15 | 172 | 1.7% |
| 14 | 1111 | |
| 13 | 960 | 9.6% |
| 12 | 457 | 4.6% |
| 11 | 428 | 4.3% |
| 10 | 2328 | |
| 9 | 2494 | |
| 8 | 136 | 1.4% |
| 7 | 385 | 3.9% |
marital-status
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6315 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1478 |
| Zeros (%) | 14.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5416048 |
|---|---|
| Coefficient of variation (CV) | 0.5858274 |
| Kurtosis | -0.61040044 |
| Mean | 2.6315 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.079193128 |
| Sum | 26315 |
| Variance | 2.3765454 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 4222 | |
| 4 | 3449 | |
| 0 | 1478 | 14.8% |
| 6 | 328 | 3.3% |
| 5 | 284 | 2.8% |
| 3 | 224 | 2.2% |
| 1 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1478 | 14.8% |
| 1 | 15 | 0.1% |
| 2 | 4222 | |
| 3 | 224 | 2.2% |
| 4 | 3449 | |
| 5 | 284 | 2.8% |
| 6 | 328 | 3.3% |
| Value | Count | Frequency (%) |
| 6 | 328 | 3.3% |
| 5 | 284 | 2.8% |
| 4 | 3449 | |
| 3 | 224 | 2.2% |
| 2 | 4222 | |
| 1 | 15 | 0.1% |
| 0 | 1478 | 14.8% |
occupation
Real number (ℝ)
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9165 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 1276 |
| Zeros (%) | 12.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 5 |
| Q3 | 10 |
| 95-th percentile | 13 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.2955993 |
|---|---|
| Coefficient of variation (CV) | 0.72603723 |
| Kurtosis | -1.2014529 |
| Mean | 5.9165 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.22249408 |
| Sum | 59165 |
| Variance | 18.452173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1867 | |
| 0 | 1276 | |
| 8 | 1120 | |
| 10 | 1119 | |
| 12 | 986 | |
| 1 | 888 | |
| 7 | 644 | 6.4% |
| 4 | 619 | 6.2% |
| 14 | 386 | 3.9% |
| 5 | 373 | 3.7% |
| Other values (5) | 722 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 1276 | |
| 1 | 888 | |
| 2 | 25 | 0.2% |
| 3 | 1867 | |
| 4 | 619 | 6.2% |
| 5 | 373 | 3.7% |
| 6 | 300 | 3.0% |
| 7 | 644 | 6.4% |
| 8 | 1120 | |
| 9 | 68 | 0.7% |
| Value | Count | Frequency (%) |
| 14 | 386 | 3.9% |
| 13 | 180 | 1.8% |
| 12 | 986 | |
| 11 | 149 | 1.5% |
| 10 | 1119 | |
| 9 | 68 | 0.7% |
| 8 | 1120 | |
| 7 | 644 | |
| 6 | 300 | 3.0% |
| 5 | 373 | 3.7% |
relationship
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4924 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 4620 |
| Zeros (%) | 46.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.6974777 |
|---|---|
| Coefficient of variation (CV) | 1.1374147 |
| Kurtosis | -1.0521511 |
| Mean | 1.4924 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.6683013 |
| Sum | 14924 |
| Variance | 2.8814304 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4620 | |
| 3 | 1672 | 16.7% |
| 1 | 1537 | 15.4% |
| 4 | 1194 | 11.9% |
| 5 | 547 | 5.5% |
| 2 | 430 | 4.3% |
| Value | Count | Frequency (%) |
| 0 | 4620 | |
| 1 | 1537 | 15.4% |
| 2 | 430 | 4.3% |
| 3 | 1672 | 16.7% |
| 4 | 1194 | 11.9% |
| 5 | 547 | 5.5% |
| Value | Count | Frequency (%) |
| 5 | 547 | 5.5% |
| 4 | 1194 | 11.9% |
| 3 | 1672 | 16.7% |
| 2 | 430 | 4.3% |
| 1 | 1537 | 15.4% |
| 0 | 4620 |
race
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 4 | |
|---|---|
| 2 | 623 |
| 1 | 297 |
| 3 | 87 |
| 0 | 57 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 8936 | |
| 2 | 623 | 6.2% |
| 1 | 297 | 3.0% |
| 3 | 87 | 0.9% |
| 0 | 57 | 0.6% |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5955 | |
| 0 | 4045 |
capital-gain
Real number (ℝ)
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2682.5949 |
| Minimum | -1913.2347 |
|---|---|
| Maximum | 123074.37 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4638 |
| Negative (%) | 46.4% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -1913.2347 |
|---|---|
| 5-th percentile | -113.97831 |
| Q1 | -56.686392 |
| median | 10.166255 |
| Q3 | 94.483273 |
| 95-th percentile | 13169.569 |
| Maximum | 123074.37 |
| Range | 124987.6 |
| Interquartile range (IQR) | 151.16967 |
Descriptive statistics
| Standard deviation | 11806.093 |
|---|---|
| Coefficient of variation (CV) | 4.4009973 |
| Kurtosis | 60.398078 |
| Mean | 2682.5949 |
| Median Absolute Deviation (MAD) | 73.451276 |
| Skewness | 7.527322 |
| Sum | 26825949 |
| Variance | 1.3938383 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.71298691 | 1 | < 0.1% |
| 114.9299012 | 1 | < 0.1% |
| 842.9985567 | 1 | < 0.1% |
| 101.7188548 | 1 | < 0.1% |
| 99.33068808 | 1 | < 0.1% |
| 39.69315116 | 1 | < 0.1% |
| -80.51026183 | 1 | < 0.1% |
| 84277.1112 | 1 | < 0.1% |
| 67.51765831 | 1 | < 0.1% |
| 12290.59791 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -1913.234698 | 1 | |
| -1854.802914 | 1 | |
| -1822.593837 | 1 | |
| -1611.872358 | 1 | |
| -1497.689087 | 1 | |
| -1473.581214 | 1 | |
| -1435.438832 | 1 | |
| -1429.901932 | 1 | |
| -1403.61311 | 1 | |
| -1238.173059 | 1 |
| Value | Count | Frequency (%) |
| 123074.3699 | 1 | |
| 122920.4687 | 1 | |
| 121824.594 | 1 | |
| 121372.3405 | 1 | |
| 120794.2426 | 1 | |
| 119942.4656 | 1 | |
| 119804.6572 | 1 | |
| 119459.6174 | 1 | |
| 118995.5597 | 1 | |
| 118528.1543 | 1 |
capital-loss
Real number (ℝ)
| Distinct | 9997 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.701187 |
| Minimum | -7.4920681 |
|---|---|
| Maximum | 3111.6087 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 6610 |
| Negative (%) | 66.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -7.4920681 |
|---|---|
| 5-th percentile | -6.4831587 |
| Q1 | -4.5091744 |
| median | -1.9732618 |
| Q3 | 1.2530795 |
| 95-th percentile | 5.6527159 |
| Maximum | 3111.6087 |
| Range | 3119.1007 |
| Interquartile range (IQR) | 5.7622538 |
Descriptive statistics
| Standard deviation | 303.11376 |
|---|---|
| Coefficient of variation (CV) | 5.9784352 |
| Kurtosis | 35.10283 |
| Mean | 50.701187 |
| Median Absolute Deviation (MAD) | 2.8224199 |
| Skewness | 5.9180979 |
| Sum | 507011.87 |
| Variance | 91877.951 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5.344715468 | 2 | < 0.1% |
| 4.514324008 | 2 | < 0.1% |
| 0.6323548472 | 2 | < 0.1% |
| -3.549109534 | 1 | < 0.1% |
| 1.597165456 | 1 | < 0.1% |
| -5.098919567 | 1 | < 0.1% |
| 5.927915521 | 1 | < 0.1% |
| -6.904283028 | 1 | < 0.1% |
| -2.679940538 | 1 | < 0.1% |
| -6.219695965 | 1 | < 0.1% |
| Other values (9987) | 9987 |
| Value | Count | Frequency (%) |
| -7.492068079 | 1 | |
| -7.486674418 | 1 | |
| -7.46900707 | 1 | |
| -7.466165568 | 1 | |
| -7.461362444 | 1 | |
| -7.460763249 | 1 | |
| -7.45493261 | 1 | |
| -7.453376983 | 1 | |
| -7.443305506 | 1 | |
| -7.427548289 | 1 |
| Value | Count | Frequency (%) |
| 3111.608651 | 1 | |
| 3094.773145 | 1 | |
| 3055.782902 | 1 | |
| 2867.454631 | 1 | |
| 2790.67168 | 1 | |
| 2720.418574 | 1 | |
| 2673.17814 | 1 | |
| 2661.757182 | 1 | |
| 2649.753209 | 1 | |
| 2597.935411 | 1 |
hours-per-week
Real number (ℝ)
| Distinct | 9999 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.31361 |
| Minimum | -7.7514182 |
|---|---|
| Maximum | 117.37886 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 151 |
| Negative (%) | 1.5% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -7.7514182 |
|---|---|
| 5-th percentile | 12.550952 |
| Q1 | 39.74206 |
| median | 39.926922 |
| Q3 | 40.154293 |
| 95-th percentile | 55.982556 |
| Maximum | 117.37886 |
| Range | 125.13027 |
| Interquartile range (IQR) | 0.4122338 |
Descriptive statistics
| Standard deviation | 11.374256 |
|---|---|
| Coefficient of variation (CV) | 0.29687248 |
| Kurtosis | 5.9102673 |
| Mean | 38.31361 |
| Median Absolute Deviation (MAD) | 0.20142734 |
| Skewness | -0.80484202 |
| Sum | 383136.1 |
| Variance | 129.37371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.8181023 | 2 | < 0.1% |
| 39.97167261 | 1 | < 0.1% |
| 35.41662975 | 1 | < 0.1% |
| 39.8712364 | 1 | < 0.1% |
| 39.93698594 | 1 | < 0.1% |
| 22.67485279 | 1 | < 0.1% |
| 39.8326533 | 1 | < 0.1% |
| 21.06893913 | 1 | < 0.1% |
| 40.07141854 | 1 | < 0.1% |
| 40.03405905 | 1 | < 0.1% |
| Other values (9989) | 9989 |
| Value | Count | Frequency (%) |
| -7.75141821 | 1 | |
| -7.68790913 | 1 | |
| -7.549939844 | 1 | |
| -7.491037189 | 1 | |
| -7.370022847 | 1 | |
| -7.369076955 | 1 | |
| -7.293413502 | 1 | |
| -7.114686207 | 1 | |
| -7.074182167 | 1 | |
| -7.048911727 | 1 |
| Value | Count | Frequency (%) |
| 117.3788564 | 1 | |
| 111.963234 | 1 | |
| 108.3094824 | 1 | |
| 108.1831096 | 1 | |
| 107.7032071 | 1 | |
| 105.4779893 | 1 | |
| 104.4486134 | 1 | |
| 103.3344671 | 1 | |
| 102.59429 | 1 | |
| 102.1703515 | 1 |
native-country
Real number (ℝ)
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.8924 |
| Minimum | 0 |
|---|---|
| Maximum | 41 |
| Zeros | 134 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 39 |
| median | 39 |
| Q3 | 39 |
| 95-th percentile | 39 |
| Maximum | 41 |
| Range | 41 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.1594656 |
|---|---|
| Coefficient of variation (CV) | 0.19406343 |
| Kurtosis | 14.339089 |
| Mean | 36.8924 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -3.8107823 |
| Sum | 368924 |
| Variance | 51.257948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39 | 8882 | |
| 26 | 282 | 2.8% |
| 0 | 134 | 1.3% |
| 30 | 87 | 0.9% |
| 8 | 32 | 0.3% |
| 19 | 30 | 0.3% |
| 35 | 27 | 0.3% |
| 22 | 24 | 0.2% |
| 33 | 22 | 0.2% |
| 23 | 21 | 0.2% |
| Other values (32) | 459 | 4.6% |
| Value | Count | Frequency (%) |
| 0 | 134 | |
| 1 | 14 | 0.1% |
| 2 | 13 | 0.1% |
| 3 | 21 | 0.2% |
| 4 | 10 | 0.1% |
| 5 | 20 | 0.2% |
| 6 | 17 | 0.2% |
| 7 | 18 | 0.2% |
| 8 | 32 | 0.3% |
| 9 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 41 | 16 | 0.2% |
| 40 | 18 | 0.2% |
| 39 | 8882 | |
| 38 | 19 | 0.2% |
| 37 | 14 | 0.1% |
| 36 | 14 | 0.1% |
| 35 | 27 | 0.3% |
| 34 | 15 | 0.1% |
| 33 | 22 | 0.2% |
| 32 | 12 | 0.1% |
target
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8349 | |
| 0 | 1651 | 16.5% |
| age | workclass | fnlwgt | education | education-num | marital-status | occupation | relationship | capital-gain | capital-loss | hours-per-week | native-country | race | sex | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.041 | 0.024 | -0.003 | 0.026 | 0.026 | 0.009 | 0.002 | 0.048 | 0.010 | 0.014 | -0.002 | 0.017 | 0.014 | 0.032 |
| workclass | 0.041 | 1.000 | 0.006 | -0.012 | 0.018 | -0.005 | -0.007 | -0.007 | 0.006 | 0.022 | -0.022 | 0.011 | 0.015 | 0.098 | 0.047 |
| fnlwgt | 0.024 | 0.006 | 1.000 | -0.021 | -0.000 | 0.035 | -0.010 | 0.021 | -0.060 | -0.008 | -0.050 | 0.007 | 0.033 | 0.044 | 0.016 |
| education | -0.003 | -0.012 | -0.021 | 1.000 | -0.008 | -0.062 | 0.017 | 0.037 | -0.013 | -0.032 | -0.001 | 0.015 | 0.023 | 0.064 | 0.055 |
| education-num | 0.026 | 0.018 | -0.000 | -0.008 | 1.000 | 0.011 | 0.012 | 0.012 | 0.001 | 0.009 | 0.025 | -0.011 | 0.020 | 0.055 | 0.069 |
| marital-status | 0.026 | -0.005 | 0.035 | -0.062 | 0.011 | 1.000 | -0.027 | 0.025 | 0.014 | 0.027 | -0.027 | -0.023 | 0.020 | 0.127 | 0.038 |
| occupation | 0.009 | -0.007 | -0.010 | 0.017 | 0.012 | -0.027 | 1.000 | 0.003 | -0.029 | 0.020 | 0.042 | 0.001 | 0.028 | 0.039 | 0.066 |
| relationship | 0.002 | -0.007 | 0.021 | 0.037 | 0.012 | 0.025 | 0.003 | 1.000 | -0.044 | -0.054 | -0.000 | 0.018 | 0.014 | 0.163 | 0.115 |
| capital-gain | 0.048 | 0.006 | -0.060 | -0.013 | 0.001 | 0.014 | -0.029 | -0.044 | 1.000 | -0.109 | -0.072 | 0.002 | 0.000 | 0.000 | 0.028 |
| capital-loss | 0.010 | 0.022 | -0.008 | -0.032 | 0.009 | 0.027 | 0.020 | -0.054 | -0.109 | 1.000 | -0.108 | 0.007 | 0.017 | 0.019 | 0.022 |
| hours-per-week | 0.014 | -0.022 | -0.050 | -0.001 | 0.025 | -0.027 | 0.042 | -0.000 | -0.072 | -0.108 | 1.000 | -0.001 | 0.015 | 0.000 | 0.000 |
| native-country | -0.002 | 0.011 | 0.007 | 0.015 | -0.011 | -0.023 | 0.001 | 0.018 | 0.002 | 0.007 | -0.001 | 1.000 | 0.003 | 0.014 | 0.024 |
| race | 0.017 | 0.015 | 0.033 | 0.023 | 0.020 | 0.020 | 0.028 | 0.014 | 0.000 | 0.017 | 0.015 | 0.003 | 1.000 | 0.034 | 0.000 |
| sex | 0.014 | 0.098 | 0.044 | 0.064 | 0.055 | 0.127 | 0.039 | 0.163 | 0.000 | 0.019 | 0.000 | 0.014 | 0.034 | 1.000 | 0.081 |
| target | 0.032 | 0.047 | 0.016 | 0.055 | 0.069 | 0.038 | 0.066 | 0.115 | 0.028 | 0.022 | 0.000 | 0.024 | 0.000 | 0.081 | 1.000 |
| age | workclass | fnlwgt | education | education-num | marital-status | occupation | relationship | race | sex | capital-gain | capital-loss | hours-per-week | native-country | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 38.426234 | 4 | 33512.993587 | 15 | 9.0 | 2 | 3 | 0 | 4 | 1 | 19.712987 | -3.549110 | 39.971673 | 39 | 1 |
| 1 | 36.540765 | 4 | 369555.878603 | 6 | 13.0 | 2 | 5 | 0 | 4 | 0 | 21.398018 | -5.872481 | 17.708007 | 8 | 1 |
| 2 | 24.137102 | 4 | 129336.796145 | 15 | 14.0 | 4 | 7 | 5 | 4 | 0 | 109578.056894 | -7.231288 | 17.868293 | 39 | 1 |
| 3 | 34.264791 | 4 | 146540.074892 | 5 | 13.0 | 4 | 7 | 3 | 4 | 1 | -43.838008 | -2.736450 | 40.120083 | 10 | 1 |
| 4 | 32.977646 | 4 | 242860.280789 | 11 | 5.0 | 4 | 3 | 0 | 4 | 1 | 53.191204 | -1.337627 | 58.353321 | 39 | 1 |
| 5 | 18.376134 | 4 | 317972.383989 | 10 | 14.0 | 0 | 12 | 1 | 2 | 1 | -120.079859 | 3.009470 | 39.769978 | 39 | 1 |
| 6 | 29.288359 | 4 | 8985.132915 | 15 | 14.0 | 0 | 3 | 0 | 1 | 1 | 128.226057 | 3.563708 | 39.726562 | 39 | 0 |
| 7 | 38.867816 | 4 | 198580.377566 | 15 | 12.0 | 2 | 8 | 0 | 4 | 0 | 17.900497 | -5.885805 | 39.890537 | 39 | 1 |
| 8 | 18.561631 | 4 | 111726.802757 | 15 | 10.0 | 2 | 3 | 0 | 2 | 0 | -43.817705 | -3.260357 | 40.251512 | 39 | 1 |
| 9 | 82.639860 | 6 | 144589.336652 | 15 | 4.0 | 4 | 3 | 3 | 4 | 0 | 2416.363291 | -4.141429 | 44.958724 | 39 | 1 |
| age | workclass | fnlwgt | education | education-num | marital-status | occupation | relationship | race | sex | capital-gain | capital-loss | hours-per-week | native-country | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 33.575988 | 5 | 167157.261000 | 11 | 9.0 | 0 | 6 | 0 | 4 | 0 | -13.033995 | 2.730218 | 40.084302 | 39 | 1 |
| 9991 | 29.746173 | 0 | 166779.623084 | 9 | 10.0 | 2 | 4 | 0 | 4 | 0 | 120.193645 | -7.154164 | 39.907779 | 39 | 1 |
| 9992 | 19.713797 | 4 | 187234.200871 | 9 | 6.0 | 4 | 12 | 0 | 4 | 0 | 48.521430 | -4.902938 | 40.908768 | 39 | 1 |
| 9993 | 31.508190 | 4 | 209595.238300 | 5 | 14.0 | 4 | 7 | 3 | 4 | 1 | 80.091947 | -4.996954 | 39.761219 | 39 | 1 |
| 9994 | 45.324671 | 4 | 160843.487835 | 3 | 9.0 | 5 | 8 | 0 | 4 | 1 | 104.893817 | -2.505869 | 40.129714 | 39 | 0 |
| 9995 | 36.071050 | 4 | 324293.554725 | 11 | 4.0 | 4 | 4 | 0 | 4 | 0 | -56.431328 | 2.001142 | 43.442892 | 39 | 1 |
| 9996 | 26.126315 | 0 | 127335.686497 | 11 | 12.0 | 2 | 6 | 1 | 4 | 1 | 4417.602347 | 1.404273 | 39.889016 | 35 | 1 |
| 9997 | 62.463071 | 4 | 36884.439750 | 11 | 10.0 | 6 | 0 | 0 | 4 | 1 | 32.774824 | 5.491806 | 39.765348 | 39 | 0 |
| 9998 | 42.759673 | 6 | 185867.286046 | 15 | 12.0 | 2 | 1 | 0 | 4 | 1 | 7187.084519 | -2.324548 | 39.761384 | 39 | 0 |
| 9999 | 43.820347 | 6 | 213495.958902 | 15 | 9.0 | 2 | 1 | 0 | 4 | 0 | 71.918633 | -3.084349 | -3.606185 | 39 | 1 |